MUESLI: multiple utterance error correction for a spoken language interface

نویسندگان

  • Federico Cesari
  • Horacio Franco
  • Gregory K. Myers
  • Harry Bratt
چکیده

We propose a method for using all available information to help correct recognition errors in tasks that use constrained grammars of the kind used in the domain of Command and Control (CC) systems. In current spoken language CC systems, if there is a recognition error, the user repeats the same phrase multiple times until a correct recognition is achieved. This interaction can be frustrating for the user, especially at high levels of ambient noise. We aim to improve the accuracy of the error correction process by using all the previous information available at a given point, this being the previous utterances of the same input phrase and the knowledge that the previous result contained an error.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining Linguistic and Statistical Technology for Improved Spoken Language Understanding

SRI has developed a spoken language interface to the Official Airline Guide (OAG). Despite a funding gap for more than four months of the year, substantial improvements have been made in the component technologies. On recent ARPA benchmarks. SRI achieved 5.5% word error on the ATIS speech recognition task, 18.2% utterance error on the natural-language understanding task, and 20.7% utterance err...

متن کامل

Identifying local corrections in human-computer dialogue

Miscommunication in human-computer interaction is unavoidable, although speech recognition accuracy continues to improve. The perceived difficulty of correcting miscommunications has an even larger negative impact on assessments of system quality than does the absolute error rate. Therefore it is essential to improve error resolution capabilities in spoken language systems. While prior research...

متن کامل

Detection and recognition of correction utterance in spontaneously spoken dialog

Recently, the performance of speech recognition was drastically improved, and the products with the interface based on speech recognition have been realized. However, when we communicate with computers through a speech interface, misrecognition is inevitable, and it is difficult to recover from it because of the immaturity of the interface. Users try to recover from misrecognition by a repetiti...

متن کامل

Online Error Detection of Barge-In Utterances by Using Individual Users' Utterance Histories in Spoken Dialogue System

We develop a method to detect erroneous interpretation results of user utterances by exploiting utterance histories of individual users in spoken dialogue systems that were deployed for the general public and repeatedly utilized. More specifically, we classify barge-in utterances into correctly and erroneously interpreted ones by using features of individual users’ utterance histories such as t...

متن کامل

An Acoustic Study of Emotivity-Prosody Interface in Persian Speech Using the Tilt Model

This paper aims to explore some acoustic properties (i.e. duration and pitch amplitude of speech) associated with three different emotions: anger, sadness and joy against neutrality as a reference point, all being intentionally expressed by six Persian speakers. The primary purpose of this study is to find out if there is any correspondence between the given emotions and prosody patterning in P...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008